Binaural Cepstrum Coefficient and Its Application to Ground Target Recognition
نویسندگان
چکیده
Stereausis is a biologically motivated model proposed by Shamma which encodes both binaural and spectral information in a unified framework to simulate the processing of human binaural auditory system. In this paper, a new type of cepstrum coefficient is proposed based on this model. Two-channel acoustic signals are first processed by the stereausis binaural model to synthesize the spectral information and reduce the interference of noise signal. The binaural cepstrum coefficient is then extracted based on the diagonal vector of the stereausis model's output pattern, and is applied as feature to the multi-class acoustic target recognition. Learning Vector Quantization (LVQ) algorithm is implemented as the classifier and is tested by samples of vehicle acoustic signals. Experimental results show that binaural cepstrum coefficient improves both the performance and generalization of the classifier, especially at low SNR.
منابع مشابه
Zero-Crossing-Based Channel Attentive Weighting of Cepstral Features for Robust Speech Recognition: The ETRI 2011 CHiME Challenge System
We present a practical and noise-robust speech recognition system which estimates a target-to-interferers power ratio using a zero-crossing-based binaural model and applies the power ratio to a channel attentive missing feature decoder in the cepstral domain. In a natural multisource environment, our binaural model extracts spatial cues at each zero-crossing of a filterbank output signal to loc...
متن کاملClassification Of Ground Moving Object Using Coefficient Of Integrated Bispectrum For Doppler Radar
This paper considers the classification of radar target using Backscatter Doppler signature of moving object. Classification performance evaluated by the integrated Bispectrum based technique of feature extraction and compared it with Cepstrum based feature extraction technique. Classifier performance is tested by GMM (Gaussian Mixture Model) and ML (Maximal Likelihood) decision making method. ...
متن کاملMFCC and its applications in speaker recognition
Speech processing is emerged as one of the important application area of digital signal processing. Various fields for research in speech processing are speech recognition, speaker recognition, speech synthesis, speech coding etc. The objective of automatic speaker recognition is to extract, characterize and recognize the information about speaker identity. Feature extraction is the first step ...
متن کاملApproaches for Automatic Speaker Recognition in a Binaural Humanoid Context
This paper presents two methods of Automatic Speaker Recognition (ASkR). ASkR has been largely studied in the last decades, but in most cases in mono-microphone or microphone array contexts. Our systems are placed in a binaural humanoid context where the signals captured by both ears of a humanoid robot will be exploited to perform the ASkR. Both methods use Mel-Frequency Cepstral Coding (MFCC)...
متن کاملAcoustic Scene Classification Using Spatial Features
Due to various factors, the vast majority of the research in the field of Acoustic Scene Classification has used monaural or binaural datasets. This paper introduces EigenScape a new dataset of 4th-order Ambisonic acoustic scene recordings and presents preliminary analysis of this dataset. The data is classified using a standard Mel-Frequency Cepstral Coefficient Gaussian Mixture Model system, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007